Top-k best probability queries and semantics ranking properties on probabilistic databases
نویسندگان
چکیده
There has been much interest in answering top-k queries on probabilistic data in various applications such as market analysis, personalised services, and decision making. In probabilistic relational databases, the most common problem in answering top-k queries (ranking queries) is selecting the top-k result based on scores and top-k probabilities. In this paper, we firstly propose novel answers to top-k best probability queries by selecting the probabilistic tuples which have not only the best top-k scores but also the best top-k probabilities. An efficient algorithm for top-k best probability queries is introduced without requiring users to define a threshold. The top-k best probability approach is a more efficient and effective than the probability threshold approach (PT-k) [1, 2]. Second, we add the “k-best ranking score” into the set of semantic properties for ranking queries on uncertain data proposed by [3, 4]. Then, our proposed method is analysed, which meets the semantic ranking properties on uncertain data. In addition, it proves that the answers to the top-k best probability queries overcome drawbacks of previous definitions of the top-k queries on probabilistic data in terms of semantic ranking properties. Lastly, we conduct an extensive experimental study verifying the effectiveness of answers to the top-k best probability queries compared to PT-k queries on uncertain data and the efficiency of our algorithm against the state-of-the-art execution of the PT-k algorithm using both real and synthetic data sets. c ⃝ 2011 Published by Elsevier Ltd.
منابع مشابه
Top-k Best Probability Queries on Probabilistic Data
There has been much interest in answering top-k queries on probabilistic data in various applications such as market analysis, personalised services, and decision making. In relation to probabilistic data, the most common problem in answering top-k queries is selecting the semantics of results according to their scores and top-k probabilities. In this paper, we propose a novel top-k best probab...
متن کاملRobust Ranking of Uncertain Data
Numerous real-life applications are continually generating huge amounts of uncertain data (e.g., sensor or RFID readings). As a result, top-k queries that return only the k most promising probabilistic tuples become an important means to monitor and analyze such data. These “top” tuples should have both high scores in term of some ranking function, and high occurrence probability. The previous ...
متن کاملA Domination in the Probabilistic World: Computing Skylines for Arbitrary Correlations and Ranking Semantics
In a probabilistic database, deciding if a tuple u is better than another tuple v has not a univocal solution, rather it depends on the specific probabilistic ranking semantics (PRS) one wants to adopt so as to combine together tuples’ scores and probabilities. In deterministic databases it is known that skyline queries are a remarkable alternative to (top-k) ranking queries, because they remov...
متن کاملRanking queries on uncertain data pdf
Top-k queries also known as ranking queries are often natural and useful in. Ing probabilistic threshold top-k queries on uncertain data.UNCERTAIN DATA MODELS W.R.T RANKING QUERIES. Uncertain attribute based on the associated discrete pdf and the choice is.observed, the semantics of top-k queries on uncertain data can be ambiguous due to tradeoffs. Whether it is better to report highly ranked i...
متن کاملPh.D. Dissertation Proposal Probabilities and Sets in Preference Querying
User preferences in databases are attracting increasing interests with the boom of information systems and the trend of personalization. In the literature, there are two different framework on this topic, namely quantitative approaches and qualitative approaches. The former assumes the availability of a scoring function, while the latter does not. Instead, in qualitative approaches, preferences...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Data Knowl. Eng.
دوره 88 شماره
صفحات -
تاریخ انتشار 2013